rank | frequency | n-gram |
---|---|---|
1 | 10511 | -а |
2 | 6375 | -е |
3 | 6105 | -и |
4 | 3798 | -у |
5 | 3234 | -м |
rank | frequency | n-gram |
---|---|---|
1 | 1827 | -ом |
2 | 1649 | -на |
3 | 1240 | -ма |
4 | 1226 | -не |
5 | 1200 | -ни |
rank | frequency | n-gram |
---|---|---|
1 | 727 | -ија |
2 | 688 | -има |
3 | 684 | -ије |
4 | 508 | -ног |
5 | 500 | -них |
rank | frequency | n-gram |
---|---|---|
1 | 305 | -ског |
2 | 225 | -ције |
3 | 216 | -ција |
4 | 204 | -ости |
5 | 203 | -ском |
rank | frequency | n-gram |
---|---|---|
1 | 160 | -ности |
2 | 140 | -ација |
3 | 133 | -ације |
4 | 92 | -нског |
5 | 87 | -ајући |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings